We have a system using Solarwinds that does health checks on our app every five minutes.
When we get a “Down” event, I’d like to trigger a PagerDuty alert. We have all that set up and working. But we get occasional false alerts, which resolve the next time (five minutes later).
I’d like to set up the notifcations to trigger the alert, but not start the call chain for 10 minutes. If the “Up” email arrives, we’d auto-resolve the event, and no one has to wake up.
How can we get that to happen? I can’t figure out how to insert a step in the escalation policy to “do nothing for ten minutes”